Docket No. : YOR9200 1 0454US 1 



IN THE CLAIMS: 

Please amend the claims as indicated below: 

1. (Currently Amended) A method for transcribing speech of a plurality of 
5 speakers, comprising: 

providing said speech to a plurality of speech decoders, each of said 
decoders using a speaker model corresponding to a different fe* one of said speakers and 
generating a confidence score for each decoded output; and 

selecting a decoded output based on said confidence score. 

10 

2. (Original) The method of claim 1, further comprising the step of aligning 
each of said decoded outputs in time. 

3. (Original) The method of claim 1, wherein one or more of said speech 
1 5 decoders are on a remote server. 

4. (Original) The method of claim 1, further comprising the step of 
presenting said selected decoded output to a user. 

20 5. (Original) The method of claim 1 ? further comprising the step of manually 

selecting an alternate decoded output if said assigned output is incorrect. 

6. (Original) The method of claim 5, further comprising the step of adapting 
said selecting step based on said manual selection. 

25 

7. (Original) The method of claim 1, further comprising the step of 
presenting several decoded outputs to a user with an indication of said corresponding 
confidence score. 

30 8. (Original) The method of claim 1, further comprising the step of 

presenting said decoded output as a string of words if said corresponding confidence 
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score exceeds a certain threshold and as a string of phones if said corresponding 
confidence score is below a certain threshold. 

9. (Original) The method of claim 1 5 further comprising the step of 
5 presenting said decoded output as a string of words for the decoded output having the 

highest confidence score and as phones or syllables for all other decoded outputs. 

10. (Original) The method of claim 1, wherein said selecting step further 
comprises the step of determining if a decoded output includes an isolated word from a 

10 second speaker in a string of words from a first speaker. 

1 1 . (Currently Amended) A method for transcribing speech of a plurality of 
speakers, comprising: 

providing said speech to a speaker independent speech recognition system 
15 and a speaker specific speech recognition system substantially simultaneously ; and 

decoding said speech using said speaker independent speech recognition 
system whenever the identity of the current speaker is unknown. 

12. (Original) The method of claim 1 1, wherein said decoding step continues 
20 until a speaker identification system identifies an unknown speaker. 

13. (Original) The method of claim 11, wherein one or more of said speaker 
independent speech recognition system and said speaker specific speech recognition 
system are on a remote server. 

25 

14. (Original) The method of claim 11, further comprising the step of 
presenting said selected decoded output to a user. 

15. (Currently Amended) A method for transcribing speech of a plurality of 
30 speakers, comprising: 
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providing said speech to a speaker independent speech recognition system 
and a speaker specific speech recognition system substantially simultaneously ; and 

decoding said speech using said speaker specific speech recognition 
system with a speaker model for an identified speaker until there is a speaker change. 

5 

16. (Original) The method of claim 15, further comprising the step of 

decoding said speech using a speaker independent speech recognition system until the 
identity of a speaker is determined and the appropriate speaker model is loaded. 

10 17. (Original) The method of claim 15, wherein one or more of said speaker 

independent speech recognition system and said speaker specific speech recognition 
system are on a remote server. 

18. (Original) The method of claim 15, further comprising the step of 
1 5 presenting said selected decoded output to a user. 

19. (Currently Amended) A system for transcribing speech of a plurality of 
speakers, comprising: 

a memory that stores computer-readable code; and 
20 a processor operatively coupled to said memory, said processor configured 

to implement said computer-readable code, said computer-readable code configured to: 

provide said speech to a plurality of speech decoders, each of said 
decoders using a speaker model corresponding to a different for one of said speakers and 
generating a confidence score for each decoded output; and 



25 select a decoded output having a highest confidence score. 

20. (Original) The system of claim 19, wherein said processor is further 

configured to align each of said decoded outputs in time. 

30 21. (Original) The system of claim 19, wherein one or more of said speech 



decoders are on a remote server. 
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22. (Original) The system of claim 19, wherein said processor is further 
configured to present said selected decoded output to a user. 

23. (Currently Amended) A system for transcribing speech of a plurality of 
5 speakers, comprising: 

a memory that stores computer-readable code; and 

a processor operatively coupled to said memory, said processor configured 
to implement said computer-readable code, said computer-readable code configured to: 

provide said speech to a speaker independent speech recognition system 
10 and a speaker specific speech recognition system substantially simultaneously : and 

decode said speech using said speaker independent speech recognition 
system whenever the identity of the current speaker is unknown. 

24. (Original) The system of claim 23, wherein said processor performs said 
15 decoding until a speaker identification system identifies an unknown speaker. 

25. (Original) The system of claim 23, wherein one or more of said speaker 
independent speech recognition system and said speaker specific speech recognition 
system are on a remote server. 

20 

26. (Original) The system of claim 23, wherein said processor is further 
configured to present said selected decoded output to a user. 

27. (Currently Amended) A system for transcribing speech of a plurality of 
25 speakers, comprising: 

a memory that stores computer-readable code; and 

a processor operatively coupled to said memory, said processor configured 
to implement said computer-readable code, said computer-readable code configured to: 

provide said speech to a speaker independent speech recognition system 
30 and a speaker specific speech recognition system substantially simultaneously : and 
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decode said speech using said speaker specific speech recognition system 
with a speaker model for an identified speaker until there is a speaker change. 

28. (Original) The system of claim 27, wherein said processor is further 
5 configured to decode said speech using a speaker independent speech recognition system 

until the identity of a speaker is determined and the appropriate speaker model is loaded. 

29. (Original) The system of claim 27, wherein one or more of said speaker 
independent speech recognition system and said speaker specific speech recognition 

10 system are on a remote server. 

30. (Original) The system of claim 27, wherein said processor is further 
configured to present said selected decoded output to a user. 

15 31. (Currently Amended) An article of manufacture for transcribing speech of 

a plurality of speakers, comprising: 

a computer readable medium having computer readable code means 
embodied thereon, said computer readable program code means comprising: 

a step to provide said speech to a plurality of speech decoders, each of said 
20 decoders using a speaker model corresponding to a different for one of said speakers and 
generating a confidence score for each decoded output; and 

a step to select a decoded output having a highest confidence score. 

32. (Currently Amended) An article of manufacture for transcribing speech of 

25 a plurality of speakers, comprising: 

a computer readable medium having computer readable code means 
embodied thereon, said computer readable program code means comprising: 

a step to provide said speech to a speaker independent speech recognition 
system and a speaker specific speech recognition system substantially simultaneously : 
30 and 
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a step to decode said speech using said speaker independent speech 
recognition system whenever the identity of the current speaker is unknown. 

33. (Currently Amended) An article of manufacture for transcribing speech of 

a plurality of speakers, comprising: 

a computer readable medium having computer readable code means 
embodied thereon, said computer readable program code means comprising: 

a step to provide said speech to a speaker independent speech recognition 
system and a speaker specific speech recognition system substantially simultaneously ; 
and 

a step to decode said speech using said speaker specific speech recognition 
system with a speaker model for an identified speaker until there is a speaker change. 
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